Multi-pitch and periodicity analysis model for sound separation and auditory scene analysis
نویسندگان
چکیده
A model for multi-pitch and periodicity analysis of complex audio signals is presented that is more efficient and practical than the Meddis and O’Mard unitary pitch perception model, yet exhibits very similar behavior. In this paper we also demonstrate how to apply this model to source separation of complex audio signals such as polyphonic and multi-instrumental music and mixtures of simultaneous speakers. Such analysis techniques are important for automatic transcription of music and structural representation of audio signals. (See also: http://www.acoustics.hut.fi/ ̃ttolonen/icassp99/pitchdet/)
منابع مشابه
A computationally efficient multipitch analysis model
A computationally efficient model for multipitch and periodicity analysis of complex audio signals is presented. The model essentially divides the signal into two channels, below and above 1000 Hz, computes a “generalized” autocorrelation of the low-channel signal and of the envelope of the high-channel signal, and sums the autocorrelation functions. The summary autocorrelation function (SACF) ...
متن کاملRNN-BLSTM Based Multi-Pitch Estimation
Multi-pitch estimation is critical in many applications, including computational auditory scene analysis (CASA), speech enhancement/separation and mixed speech analysis; however, despite much effort, it remains a challenging problem. This paper uses the PEFAC algorithm to extract features and proposes the use of recurrent neural networks with bidirectional Long ShortTerm Memory (RNN-BLSTM) to m...
متن کاملCalculating Tonal Fusion by The Generalizes Coincidence Function
1. Background Based on the frequency analysis of the cochlea and the volley principle, as Wever (1949) described it, periodic sound is represented in the auditory system by periodic firing patterns. Cariani & Delgutte (1996) have shown that the distribution of interspike intervals (ISI) between all spikes in the auditory nerve provide a code of acoustical information. A time series analysis of ...
متن کاملA Hybrid Approach for Co-Channel Speech Segregation based on CASA, HMM Multipitch Tracking, and Medium Frame Harmonic Model
This paper proposes a hybrid approach for cochannel speech segregation. HMM (hidden Markov model) is used to track the pitches of 2 talkers. The resulting pitch tracks are then enriched with the prominent pitch. The enriched tracks are correctly grouped using pitch continuity. Medium frame harmonics are used to extract the second pitch for frames with only one pitch deduced using the previous s...
متن کاملA reliability criterion for time-frequency labeling based on periodicity in an auditory scene
Computational Auditory Scene Analysis (CASA) aims to model our ability to structure our acoustical environment. In a CASA context, this paper deals with a method for time-frequency labeling based on harmonic properties. The method is based on a classical pitch extraction algorithm, termed the « zerocrossing method », which is known to be particularly sensitive to any kind of interference. This ...
متن کامل